<110> Tomikawa, Mayumi 
Aikawa, Seiichi 
Matsuzawa, Fumiko 

<120> Method and Apparatus for Extracting and Evaluating Mutually Similar 
Portions in One -Dimensional Sequences in Molecules and/or Three -Dimensional 
Structures of Molecules 

<130> 522.1921D3 

<140> 09/909,809 
<141> 2001-07-23 

<150> US 08/014,867 
<151> 1993-02-08 

<160> 20 

<170> Patentln version 3.1 (created in U.S. Serial No. 09/910,071) 

<210> 1 

<21l> 37 

<212> PRT 

<213> human 

<400> 1 

Gly Asp Val Glu Lys 31y Lys Lys lie Phe lie Met Lys Cys Ser Gin 
15 10 15 



His Thr Val Glu Gly Gly Lys His Lys Thr Gly Pro Asn Leu His Gly 
20 25 30 



Leu Phe Gly Arg Lys 
35 



<210> 2 

<211> 39 

<212> PRT 

<213> bacterium 

<400> 2 

Glu Gly Asp Asp Ala Ala Ala Gly Glu Lys Val Ser Lys Lys Cys Leu 
1 5 10 15 



Ala Cys His Thr Phe Asp Gin Gly Gly Ala Asn Lys Val Gly Pro Asn 
20 25 30 



Pro Asn Leu Phe Gly Val Phe 



35 



<210> 


3 


<211> 


51 


<212> 


PRT 


<213> 


rat 


<400> 


3 



Met Ser Leu Ala He Leu Arg Val He Arg Leu Val Arg Val Phe Arg 
15 10 15 



He Phe Lys Leu Ser Arg His Ser Lys Gly Leu Gin He Leu Gly Arg 
20 25 30 



Thr Leu Lys Ala Ser Met Arg Glu Leu Gly Leu Leu He Phe Phe He 
35 40 45 



Gly Val Val 
50 



<210> 4 

<211> 142 

<212> PRT 

<213> unknown 

<220> 

<223> Figure 23A 
<400> 4 

Thr Glu Glu Gin He Ala Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp 
15 10 15 



Lys Asp Gly Asp Gly Thr He Thr Thr Lys Glu Leu Gly Thr Val Met 
20 25 30 



Arg Ser Leu Gly Gin Asn Pro Thr Glu Ala Glu Leu Gin Asp Met He 
35 40 45 



Asn Glu Val Asp Ala Asp Gly Asn Gly Thr He Asp Phe Pro Glu Phe 
50 55 60 



Leu Thr Met Met Ala Arg Lys Met Lys Asp Thr Ser Glu Glu Glu He 
65 f 70 75 80 



Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr lie Ser 



85 90 95 



Ala Ala Glu Leu Arg His Val Met Thr Asn Leu Gly Glu Lys Leu Thr 
100 105 110 



Asp Glu Glu Val Asp Glu Met lie Arg Glu Ala Asn lie Asp Gly Asp 
115 120 125 



Gly Gin Val Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
130 135 140 



<210> 5 

<211> 159 

<212> PRT 

< 2 1 3 > unknown 

<220> 

<223> Figure 23B 
<4Q0> 5 

Ala Met Asp Gin Gin Ala Glu Ala Arg Ala Phe Leu Ser Glu Glu Met 
15 10 15 



lie Ala Glu Phe Lys Ala Ala Phe Asp Met Phe Asp Ala Asp Gly Gly 
20 25 30 



Gly Asp He Ser Thr Lys Glu Leu Gly Thr Val Met Arg Met Leu Gly 
35 40 45 



Gin Asn Pro Thr Lys Glu Glu Leu Asp Ala He He Glu Glu Val Asp 
50 55 60 



Glu Asp Gly Ser Gly Thr He Asp Phe Glu Glu Phe Leu Val Met Val 
65 70 75 80 



Arg Gin Met Lys Glu Asp Ala Lys Gly Lys Ser Glu Glu Glu Leu Ala 
85 I 90 95 



Asp Cys Phe Arg He Phe Asp Lys Asn Ala Asp Gly Phe He Asp He 
100 105 HO 



Glu Glu Leu Gly Glu lie Leu Arg Ala Thr Gly Glu His Val Thr Glu 
115 ! 120 125 



Glu Asp He Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly 
130 135 140 



Arg He Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly Val Gin 
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150 


155 
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28 
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PRT 






<213> 


unknown 
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<223> 


calmodulin probe site 81 


-108, Figure 25, target 
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Leu Ala Asp Cys Phe Arg He Phe 


Asp Lys Asn Ala Asp Gly 


Phe He 
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5 


10 


15 


Asp lie Glu Glu Leu Gly Glu He 


Leu Arg Ala Thr 






20 


25 
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7 






<211> 


28 






<212> 


PRT 






<213> 


unknown 






<220> 








<223> 


calmodulin probe site 81- 


-108, probe. Figure 25 




<400> 


7 






He Arg Glu Ala Phe Arg Val Phe 


Asp Lys Asp Gly Asn Gly Tyr He 


1 


5 


10 


15 



Ser Ala Ala Glu Leu Arg His Val Met Thr Asn Leu 
20 25 



<210> 8 

<211> 27 

<212> PRT 

<213> unknown 

<220> 

<223> Calmodulin probe site 117-143, Figure 26, target 

<400> 8 



He Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly Arg He 
1 5 10 15 



Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly 
20 25 



<210> 9 

<211> 27 

<212> PRT 

<213> unkown 

<400> 9 

Val Asp Glu Met lie Arg Glu Ala Asn lie Asp Gly Asp Gly Gin Val 
1 '5 10 15 



Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
20 25 



<210> 10 

<211> 8 

<212> PRT 

<213> unknown 

<220> 

<223> Coordinates of C-alpha corresponding to the aminon acid residue N 
os. 7 to 14 in elongation factor of protein which is a binding si 
te for phosphoric acid of GTP 

<400> 10 

Gly His Val Asp His Cly Lys Thr 
1 5 



<210> 11 

<211> 8 

<212> PRT 

<213> unknown 

<220> 

<223> adenylate kinase - target 

<400> 11 

Gly Ala Pro Gly Ser Gly Lys Gly 
1 5 



<210> 12 

<211> 8 

<212> PRT 

<213> unknown 

<220> 

<223> ras protein - target 



<400> 12 



Gly Ala Gly Gly Val Gly Lys Ser 



1 
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<210> 


13 


<211> 
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<212> 


PRT 


<213> 


unknown 


<220> 




<223> 


adenylate kinase (3ADK) 


<400> 


13 


Gly Gly Pro Gly Ser Gly Lys Gly 


1 
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<210> 


14 


<211> 


223 


<212> 


PRT 


<213> 


unknown 


<220> 




<223> 


amino acid sequence of trypsin 


<400> 


14 



lie Val Gly Gly Tyr Thr Cys Cys Ala Asn Thr Val Pro Tyr Gin Val 
15 10 15 



Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu lie Asn Ser 
20 25 30 



Gin Trp Val Val Ser Ala Ala His Cys Tyr Lys Ser Gly lie Gin Val 
35 40 45 



Arg Leu Gly Glu Asp Asn lie Asn Val Val Glu Gly Asn Glu Gin Phe 
50 55 60 



lie Ser Ala Ser Lys Ser lie Val His Pro Ser Tyr Asn Ser Asn Thr 
65 70 75 80 



Leu Asn Asn Asp lie Met Leu lie Lys Leu Lys Ser Ala Ala Ser Leu 
85 90 95 



Asn Ser Arg Val Ala Ser lie Ser Leu Pro Thr Ser Cys Ala Ser Ala 
100 105 110 



Gly Thr Gin Cys Leu lie Ser Gly Trp Gly Asn Thr Lys Ser Ser Gly 
115 120 125 



Thr Ser Tyr Pro Asp Val Leu Lys Cys Leu Lys Ala Pro lie Leu Ser 
130 135 140 



Asp Ser Ser Cys Lys Ser Ala Tyr Pro Gly Gin He Thr Ser Asn Met 
145 150 155 160 



Phe Cys Ala Gly Tyr Leu Glu Gly Gly Lys Asp Ser Cys Gin Gly Asp 
165 170 175 



Ser Gly Gly Pro Val Val Cys Ser Gly Lys Leu Gin Gly He Val Ser 
180 185 190 



Trp Gly Ser Gly Cys Ala Gin Lys Asn Lys Pro Gly Val Tyr Thr Lys 
195 200 205 



Val Cys Asn Tyr Val Ser Trp He Lys Gin Thr He Ala Ser Asn 
210 215 220 



<210> 15 

<211> 239 

<212> PRT 

<213> unknown 

<220> 

<223> amino acid sequence of Elastase 

<400> 15 

Val Val Gly Gly Thr Glu Ala Gin Arg Asn Ser Trp Pro Ser Gin He 
1 5 10 15 



Ser Leu Gin Tyr Arg Ser Gly Ser Ser Trp Ala His Thr Cys Gly Gly 
20 25 30 



Thr Leu lie Arg Gin Asn Trp Val Met Thr Ala Ala His Cys Val Asp 
35 40 45 



Arg Glu Leu Thr Phe Arg Val Val Val Gly Glu His Asn Leu Asn Gin 
50 55 60 



Asn Asn Gly Thr Glu Gin Tyr Val Gly Val Gin Lys He Val Val Pro 



65 



70 



75 



80 



Tyr Trp Asn Thr Asp Asp Val Ala Ala Gly Tyr Asp lie Ala Leu Leu 
85 90 95 



Arg Leu Ala Gin Ser Val Thr Leu Asn Ser Tyr Val Gin Leu Gly Val 
100 105 110 



Leu Pro Arg Ala Gly Thr lie Leu Ala Asn Ser Pro Cys Tyr lie Thr 
115 120 125 



Thr Gly Trp Gly Leu Thr Arg Thr Asn Gly Gin Leu Ala Gin Thr Leu 

130 135 140 



Gin Gin Ala Tyr Leu Pro Thr Val Asp Tyr Ala lie Cys Ser Ser Ser 
145 150 155 160 



Ser Tyr Trp Gly Ser Thr Val Lys Asn Ser Met Val Cys Ala Gly Gly 
165 170 175 



Asp Gly Val Arg Ser Gly Cys Gin Gly Asp Ser Gly Gly Pro Leu His 
180 185 190 



Cys Leu Val Asn Gly Gin Tyr Ala Val His Gly Val Thr Ser Phe Val 
195 200 205 



Ser Arg Leu Gly Cys Asn Val Thr Arg Lys Pro Thr Val Phe Thr Arg 
210 215 220 



Val Ser Ala Tyr lie Ser Trp lie Asn Asn Val lie Ala Ser Asn 
225 230 235 



<210> 16 

<211> 6 

<212> PRT 

<213> unknown 

<220> 

<223> histidine active sites (41-46) of elastase 

<400> 16 



Met Thr Ala Ala His Cys 
1 5 



<210> 17 

<211> 6 

<212> PRT 

<213> unknown 

<220> 

<223> histidine active sites (36-41) of trypsin 

<400> 17 

Val Ser Ala Ala His Cys 
1 5 



<210> 18 

<211> 5 

<212> PRT 

<213> unknown 

<220> 

<223> serine active sites of elastase 

<400> 18 

Gly Asp Ser Gly Gly 

1 5 



<210> 19 

<211> 5 

<212> PRT 

<213> unknown 

<220> 

<223> serine active sites (175-179) of trypsin 

<400> 19 

Gly Asp Ser Gly Gly 
1 5 



<210> 20 
<211> 120 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Figure 46 - examples of outputs protein A and protein B 
<400> 20 

Ala Thr Tyr Arg Leu Lys Asn Gly Asp Asp Pro Leu Ser Asp His Lys 
15 10 15 



Leu Ser Asp Phe Gly Phe Ser Gly Lys He Leu Thr Arg Tyr Asn Gin 



20 



25 



30 



Ala Val Gly Asp Asp Pro Leu Asp Gly Ala Arg Ser Asp Phe Gly Phe 
35 40 45 



Ser Gly Lys He Pro Leu Thr Arg Tyr Asn Val Gin Asn Asp Glu Ser 
50 55 60 



Gly Phe Arg Thr Lys His Gly Leu Asp Ala Ser Asn Ala Ala Thr Tyr 
65 70 75 80 



Arg Leu Lys Asn Asp Asp Pro Leu Asp Gly Ala Arg Glu Lys Leu Ser 
85 90 95 



Asp Phe Gly Ala His Phe Ser Gly Lys He Phe Thr Arg Tyr Asn Gin 
100 105 110 



Ala Val Gin Asn Asp Glu 
115 



Ser Lys 
120 



